Monte Carlo Noisy HMM Estimation and Segmental Differential Features on the Aurora2 Clean Training Evaluation

نویسندگان

  • Jing-Teng Zeng
  • Cheng-Chang Lee
  • Jeng-Shien Lin
  • Yuan-Fu Liao
چکیده

In this paper, the compensation of mismatch between clean training hidden Markov models (HMMs) and noisy test speech is addressed. The purpose is to approach the performance of Aurora2 multi-condition training but use only clean training material. The idea is to integrate three methods including (1) mean subtraction, variance normalization and ARMA filtering (MVA) post-processing for Mel-scaled cepstral coefficients (MFCCs) normalization, (2) Monte Carlo noisy HMM estimation by adding artificial noises in the linear mel-scale filterbank parameter (MELSPEC) domain and (3) novel segmental differential features for increasing recognizer’s discriminative power. Experimental results on Aurora2 clean training corpus have shown that great performance improvement was achieved. Especially, although only clean training material was used, the performance did close to the level of Aurora2 multi-condition training.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Speech Recognition Using Generalized Distillation Framework

In this paper, we propose a noise robust speech recognition system built using generalized distillation framework. It is assumed that during training, in addition to the training data, some kind of ”privileged” information is available and can be used to guide the training process. This allows to obtain a system which at test time outperforms those built on regular training data alone. In the c...

متن کامل

Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks

In this paper, we have evaluated a noisy speech recognition method based on noise reduction and acoustic model adaptation, on the AURORA2 tasks. For noise reduction method, we employed two noise reduction methods. One is an Adaptive Sub-Band Spectral Subtraction (ASBSS) method which can vary noise subtraction rate according to SNR in frequency bands at each frame. The other is a Kalman filterin...

متن کامل

HMM modelling of additive noise in the western languages context

This paper is concerned to the noisy speech HMM modelling when the noise is additive, speech independent and the spectral analysis is based on subbands. The internal distributions of the noisy speech HMM’s were derived when Gaussian mixture density distributions for clean speech HMM modelling are used, and the noise is normally distributed and additive in the time domain. In these circumstances...

متن کامل

Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition

We describe a novel algorithm for recursive estimation of nonstationary acoustic noise which corrupts clean speech, and a successful application of the algorithm in the speech feature enhancement framework of noise-normalized SPLICE for robust speech recognition. The noise estimation algorithm makes use of a nonlinear model of the acoustic environment in the cepstral domain. Central to the algo...

متن کامل

Speech Enhancement Based on Snr-dependent Empirical Statistical Estimation in Log-spectral Magnitude Domain

We present a data-driven speech enhancement system based on empirical statistical estimations of speech in the log-spectral magnitude domain, where the enhancement filter is trained at each SNR index. We use a estimation method called SNRGMM, which were developed in our previous work, to cluster the training data and learn the enhancement filter at each SNR index. This measurement is later used...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006